Duty cycle corrector circuits

ABSTRACT

Duty cycle corrector circuits that utilize differing unit delay elements in their delay lines in either a graduated or a stepped unit time delay arrangement are for synchronizing with a clock signal. These graduated or a stepped unit time delays allow reduction in the number of the fine unit delay elements of the delay lines by placing a fine delay element granularity at the most critical timings to sense and adjust for the portion of the clock signal time period that are high speed or critical.

RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No. 12/901,811 (allowed), filed Oct. 11, 2010 and titled “METHODS AND APPARATUS FOR SYNCHRONIZING WITH A CLOCK SIGNAL,” which is a divisional of U.S. patent application Ser. No. 12/345,039 (now U.S. Pat. No. 7,812,657), filed Dec. 29, 2008 and titled “METHODS AND APPARATUS FOR SYNCHRONIZING WITH A CLOCK SIGNAL,” which is a continuation of U.S. patent application Ser. No. 11/132,502 (now U.S. Pat. No. 7,471,130), filed May 19, 2005 and titled “GRADUATED DELAY LINE FOR INCREASED CLOCK SKEW CORRECTION CIRCUIT OPERATING RANGE,” which are commonly assigned and incorporated by reference in their entirety herein.

TECHNICAL FIELD

The invention relates to integrated circuits, and more particularly, to clock synchronization and skew adjustment circuits.

BACKGROUND

In modern integrated circuits and computer systems, increasing clock speed and high speed signal transitions are resulting in an increased need for accurate signal timings, communications, and system clock synchronization to allow for proper operation. For example, memory access speed and the resulting data transfer bandwidth has been a typical bottleneck in computer systems and other digital applications. A newer type of dynamic random access memory (DRAM), known as a synchronous DRAM or SDRAM, has been developed to provide faster operation and improve memory access times. SDRAMs are designed to operate synchronously with the system clock with input and output data synchronized to an active edge of the system clock which is driving the processor accessing the SDRAM.

Although SDRAMs have overcome some of the timing disadvantages of other memory devices memory, access is still a limiting factor, and there is a need for still faster memory devices. With this in mind, double data rate (DDR) SDRAMs were developed to allow data transfers on both the rising and falling edges of the system data clock, providing twice the operating speed of the conventional SDRAM. Thus, DDR SDRAM provides up to twice as much data bandwidth as the conventional SDRAM for a given data clock. In addition, as with SDRAM, DDR SDRAMs are also capable of providing bursts of data at a high-speed data rate. It is noted that other synchronous memory types, including, but not limited to quad data rate (QDR), synchronous graphic DRAM (SGDRAM), DDR II SDRAM, and Rambus memory standards. It is further noted that other memory types, memory busses and memory interfaces, including, but not limited to, video RAM (VRAM), static RAM (SRAM), read only memory (ROM), electrically erasable programmable ROM (EEPROM), and Flash memory with both synchronous and asynchronous interfaces, are also well known in the art.

As system and integrated circuit clock frequencies increase, it is desirable to have less uncertainty in signal timings. For example, one such critical timing in modern computer systems is when valid data is available on the output of memory devices. In modern integrated circuits, clock synchronization and skew adjustment circuits (also known as clock recovery and/or duty cycle correction circuits) are commonly utilized to reduce this signal timing uncertainty by synchronizing with clock signals and/or other periodic signals and adjusting the signal delay and/or conditioning the signal itself. Such clock synchronization and skew adjustment circuits include, but are not limited to the digital delay locked loop (DLL), the synchronous mirror delay (SMD), and the duty cycle corrector (DCC). These circuits typically utilize large delay lines of multiple sequentially coupled delay elements to achieve their goal of synchronizing with and/or adjusting the duty cycle of an input periodic signal, such as a system clock. Because of this, the circuits can consume a large area of the integrated circuit die and consume significant amounts of power while in operation. In addition, as clock frequencies increase and signal timings become increasingly critical, the need for finer resolution in these clock synchronization and skew adjustment circuits is increasing. This in turn typically requires an increase in the number of the delay elements and a decrease in their individual time delay to achieve the required granularity in the delay lines utilized by these circuits. This is particularly an issue in integrated circuits that must operate over a wide range of clock frequencies in that the granularity of the delay lines and delay elements must be sized for the required resolution of the highest clock frequencies and yet be long enough to operated with/contain the slowest signals of the specified range of frequencies within the delay line.

For the reasons stated above, and for other reasons stated below which will become apparent to those skilled in the art upon reading and understanding the present specification, there is a need in the art for a system to reduce clock synchronization and skew adjustment circuit size in integrated circuits and memory devices while maintaining adequate resolution.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 details a SMD circuit in accordance with embodiments of the present invention.

FIG. 2 details a DCC circuit in accordance with embodiments of the present invention.

FIG. 3 details a DLL circuit in accordance with embodiments of the present invention.

FIGS. 4 and 5 detail stepped and graduated delay curves in accordance with embodiments of the present invention.

FIGS. 6, 7A, 7B, 8A and 8B detail lock curves of feedback based clock synchronization circuit/DLL circuits in accordance with embodiments of the present invention.

FIG. 9 details a system with a memory device in accordance with embodiments of the present invention.

DETAILED DESCRIPTION

In the following detailed description of the preferred embodiments, reference is made to the accompanying drawings that form a part hereof, and in which is shown by way of illustration specific preferred embodiments in which the inventions may be practiced. These embodiments are described in sufficient detail to enable those skilled in the art to practice the invention, and it is to be understood that other embodiments may be utilized and that logical, mechanical and electrical changes may be made without departing from the spirit and scope of the present invention. The following detailed description is, therefore, not to be taken in a limiting sense, and the scope of the present invention is defined only by the claims.

Clock synchronization and skew adjustment circuits of embodiments of the present invention utilize differing unit time delay elements in the delay lines in either a graduated or a stepped unit time delay arrangement, allowing a reduced circuit implementation and improved clock frequency lock characteristics. The stepped or graduated delay lines allow for reduced circuit size while maintaining a high resolution by placing a fine delay element granularity at the most critical timings for accurate signal lock. Integrated circuits and memory devices utilizing clock synchronization and skew adjustment circuits in accordance with embodiments of the present invention utilize time delays with either a graduated or a stepped time delay arrangement in their delay lines. These graduated or a stepped unit time delays allow reduction of the number of fine unit delay elements of the delay lines to only those necessary to sense and adjust for the high frequency signals or that portion of the clock signal time period that is critical for operation of the integrated circuit or memory device. The described invention allows clock synchronization and skew adjustment circuits to be implemented in an optimized manner that exhibits a reduced overall circuit size and power consumption, while improving lock characteristics over a wide range of frequencies.

As stated above, in modern integrated circuits, commonly utilized clock synchronization and skew adjustment circuits include, but are not limited to the digital delay locked loop (DLL), the synchronous mirror delay (SMD), and the duty cycle corrector (DCC).

DLL circuits generally consist of a delay line of individual delay elements, a shift register, phase comparator, and an associated control circuit. The DLL generates a delayed clock signal via a delay line, typically controlled with a shift register. The delay line consists of serially connected delay elements. The output from the delay line is then compared with the DLL feedback signal, and this variable delay is iteratively adjusted in the shift register by the control circuit of the DLL until a match or “lock” with the input clock signal is achieved. When the circuit containing the DLL is powering up or coming out of a low power mode it is not uncommon for 50 or more clock cycles to pass until the feedback based DLL attains a lock with the external clock, delaying operation of the integrated circuit or initial access to a memory device. In many cases the output clock signal is also adjusted for the fixed delays of the integrated circuit input clock signal path and the output path (also known as the data path) of the generated clock signal.

In contrast, a SMD circuit also generates a replica of an input clock signal, but is not considered an iterative or feedback based design. Because of this design approach, SMD circuits can attain a lock with the external clock signal in as few as two clock cycles. This ability greatly decreases the time to first access when an integrated circuit, such as a DDR SDRAM, first powers up or comes out of a low power mode. A SMD circuit generally includes a data path model delay (replicating the input buffer and output path delays), two delay lines and an accompanying control circuit. The delay lines and control circuit must be of a length sufficient to accommodate the full time period of the longest clock pulse to be matched. Because of this, an SMD circuit generally requires more layout space to implement on the integrated circuit than a DLL. In operation, an input clock signal pulse transitions through the data path model and enters the first delay line to transition through the unit delay elements. The following input clock pulse triggers the control circuit to load the contents of the first delay line into the second delay line. The loaded clock signal then becomes the basis of the internally generated clock signal as it is transitioned back out of the second delay line.

A related circuit to the DLL and SMD clock synchronization circuits is the duty cycle corrector (DCC), which is utilizes two or more delay lines to adjust or restore the proper timing and pulse width to an input signal. A DCC circuit generally includes a phase detector, and two or more sequentially coupled delay lines, where the total length of the delay lines must be of a length sufficient to accommodate the full time period of the longest clock pulse to be corrected. In operation, the DCC typically utilizes only one rising or falling edge of the clock signal and generates the other at the appropriate placement to adjust or restore the proper duty cycle of the input signal. The input clock signal pulse enters the first delay line to transition through the unit delay elements. Upon exiting the end of the first delay line the signal enters the second delay line. The output of the second (final) delay line, which should be approximately one clock signal (360°) off the incoming input signal, is then compared against the following input clock pulse by the phase detector. The phase detector then adjusts taps on the delay lines until a match occurs and the clock is “locked in.” Since the same control signals are fed from the phase detector to both of the delay lines, they remain in a fixed ratio to each other that is dependent on individual lengths of the delay lines. The fixed ratio of the first delay line to the second delay line means that their outputs can be utilized to restore the original duty cycle of a signal or to adjust the duty cycle to whatever ratio is desired. It is also noted that additional delay lines can be added in series within the sequence of the first and second delay lines to generate other signals, such as clock signals that are phase shifted at selected ratios from the input or higher frequency signals (N*tck) that are generated from and synchronized with the input clock signal (tck).

As stated above, in the clock synchronization and skew adjustment circuits of embodiments of the present invention, a mixture of lower time period fine delay elements and larger time period coarse delay elements are utilized that are stepped or graduated in their individual time delays to reduce the total number of delay elements used in the delay line. In this approach the fine delay elements are utilized to cover the high frequency and/or critical timing regions of the range of input clock signals, while the coarse delay elements are utilized in less timing critical or regions of the delay lines utilized by slower frequency clock signals. This allows for a high resolution and fine adjustment capability to be used in critical timing areas and coarser resolution in frequency ranges to be used where we are operating at slower frequency or out of specification and are thus by definition less critical. This also allows for a wider range of clock cycle times to be covered by a given clock synchronization and skew adjustment circuit with fewer frequency range crossover issues. In addition, the use of a stepped or graduated delay line of fine and coarse delay elements allows for a reduction in the total number of delay elements used, reducing the overall circuit size, layout complexity and power usage. Reducing the total number of delay elements may also reduce the size of the control circuitry of the clock synchronization and skew adjustment circuit, further reducing the overall circuit size, layout complexity and power usage.

The specification for jitter and noise in most clock synchronization and skew adjustment circuits is specified in the time to data access (t_(ac)) for the integrated circuit or memory device it is contained in. Because of the reduced number of delay elements, reduced control circuitry, reduced number of driven signal lines and reduced power usage, clock synchronization and skew adjustment circuits of embodiments of the present invention also reduce the amount of generated voltage noise and lock jitter. In addition, the improved resolution at critical timings or frequencies allow embodiments to exhibit an increased accuracy frequency lock at these timings, while the reduced number of total delay elements reduce the time to initial lock by reducing the total number of possible lock iterations.

Delay elements typically contain active devices, such as NAND gates, inverters, other logic gates or amplifiers. Delay element designs include, but are not limited to, a NAND gate coupled to an inverter, two series coupled NAND gates, two series coupled inverters. Additional delay element structures or gate delays, such as, skew limited delays and RC delays, will be apparent to those skilled in the art with the benefit of the present specification. The time delay of individual delay elements can also typically be increased by increasing the size of the channel widths of the transistors they are composed of, or, in the case of the RC delay, by increasing the capacitance or resistance of the delay element.

Prior art delay lines typically contain 100 or more of these delay elements. As most common delay elements contain one or more active devices any signal noise that is inserted in the delay line will typically get amplified as it progresses through the delay line. Because of these active devices in the delay elements, the error due to the amplified noise will tend to get worse the deeper it progresses through the delay line, increasing lock jitter and synchronization error. Thus, in addition to reducing the overall power usage and generated voltage noise, by reducing the number of active elements the error is amplified through clock synchronization and skew adjustment circuit embodiments of the present invention that utilize reduced length stepped or graduated delay lines also reduce their susceptibility to lock jitter and synchronization error.

In designing the graduated delay lines of clock synchronization and skew adjustment circuits of embodiments of the present invention, both the integrated circuit application and operational characteristics of the particular clock synchronization and skew adjustment circuit being used should be considered. In general, there are two main architecture types that are utilized in modern clock synchronization and skew adjustment circuits to be considered in designing delay line embodiments of the present invention; feedback based circuits that can contain multiple clocks in the feedback loop (wrap-around) and feedback and non-feedback based circuits that will not contain multiple clocks within their delay lines during operation.

In non-feedback and non-wrapping feedback based clock synchronization and skew adjustment circuit embodiments of the present invention (circuits that do not internally wrap-around the clock signal and go to multiple clock signals within the delay lines at higher frequencies when they run out of delay elements in their delay lines), such as SMD and DCC circuits, the graduating or stepping to coarser/lower resolution delay elements can begin as soon as the necessary fine resolution is achieved at the critical timings or frequency ranges. This stepping or graduating is typically done according to a resolution de-rating table or curve specified by the designer so as to achieve acceptable resolution at the specified operating frequency ranges of the integrated circuit it is utilized in. It is noted that in many cases, once an integrated circuit or memory device is being operated outside of the frequency range of its design specification the signal timings are out of specification anyways and therefore the accuracy of the frequency lock is, by definition, less critical. It is also noted that clock synchronization and skew adjustment circuit embodiments of the present invention can be designed so that they contain multiple regions of critical timing or frequency ranges and thus the stepping or graduation of delay elements in the delay lines of these embodiments may contain multiple regions of increased fine or coarse delay elements at the critical or non-critical regions of the delay line. It is further noted that these regions of increased fine or coarse delay elements can be progressively stepped or graduated up or down into.

In clock synchronization and skew adjustment circuit embodiments of the present invention that internally wrap-around the clock signal, such as DLL circuits, graduating or stepping to coarser/lower resolution delay elements must take into consideration the “lock curve” of the clock synchronization and skew adjustment circuit. The lock curve is a sawtooth shaped graph of the number of delay elements that will be utilized to lock at a given frequency. In this lock curve, the number of delay elements reduces to the minimum possible each time the clock signal is wrapped around internally (so that an additional clock cycle is held within the delay lines) to lock to a higher frequency clock signal range. In these circuits, the stepping or graduation of the delay line to coarser delay elements is preferably designed so that the stepping or graduation does not affect higher frequency sections of the lock curve by stepping or graduating too early and inadvertently affecting the higher frequency/higher integer clock cycle saw-tooth sections of the lock curve.

FIG. 1 details a SMD circuit 112 of an embodiment of the present invention. The SMD circuit 112 of FIG. 1 includes a clock input buffer 110, a data path model (DPM) 100, forward delay line (FDL) 102, a SMD control circuit 104, a backward delay line (BDL) 106, and the real data path (RDP) element 108. The RDP 108 symbolically represents the real data path circuitry delay added by internal circuitry of the system, such as clock drivers and other logic delays that the SMD circuit of FIG. 1 is implemented into and is not a real circuit component. The FDL 102 and BDL 106 are comprised of delay elements (not shown) that have a stepped or graduated unit delay. The input buffer 110 and real data path 108 add additional internal delays for the SMD circuit, d_(in) for the input buffer 110 and delay of d_(dp) for the RDP 108, that are compensated for by the DPM 100.

The SMD circuit 112 receives an external clock signal through the input buffer 110, which couples the received clock signal to the SMD control 104 and the DPM 100. The received clock signal is passed through the DPM circuit 100, which has a delay of d_(in)+d_(dp), replicating the delays of input buffer 110 and real data path 108 in order to account for them in the recovery of the clock signal. The clock signal is then passed to the forward delay line (FDL) 102, where it is passed sequentially from delay element to delay element. The combined delay of the DPM 100 and the FDL 102, is such that the forward edge of the received clock signal is still within the FDL 102 when the next external clock signal is received at the input buffer 110. Upon receiving the next external clock signal the input buffer 110 again couples it to the SMD control 104 and the DPM 100 circuits. The next external clock signal enters the DPM circuit 100 and the FDL 102 chain to become the basis of the next recovered clock signal. The next external clock signal also triggers the SMD control 104, which transfers the current clock signal from the position it has attained in the FDL 102 delay chain to the same position in the BDL 106 delay chain. The next external clock signal enables the BDL 106 and the initial clock signal then flows backwards through the BDL 106, which is the same length as the FDL 102 and composed of the same stepped or graduated unit delay elements. Because the BDL 106 is the same length as the FDL 102 the clock signal will take the same amount of time to transition through it and will therefore exactly replicate the input clock signal. The additional delay factors of the input buffer 110 and clock driver 108 will already be taken into account in the clock signal loaded into the BDL 106 by the pre-delay of the DPM 100, that effectively subtracts the delay factors from the time period measured by the FDL 102. The recovered and skew adjusted clock signal is coupled from the BDL 106 to provide a synchronous clock for the circuit that the SMD circuit of FIG. 1 is implemented into, the delay of which is represented by the real data path 108.

FIG. 2 details a DCC circuit 200 of an embodiment of the present invention that has been designed to adjust an input clock signal back to a 50% duty cycle. The DCC circuit 200 contains two sequentially coupled delay lines 210, 212 and a phase detector 214. Because the DCC circuit 200 of FIG. 2 corrects the input clock signal back to a 50% duty cycle, it contains two sequentially coupled delay lines 210, 212 which are equal in length. In operation, a clock signal enters the DCC on the clock input 202 and coupled to the phase detector 214, the first delay line 210, and a 0° clock output 204. The clock signal propagates through the first delay line 210 and, upon its output, is coupled to the input of the second delay line 212 and an intermediate 180° clock output 206. The clock signal then propagates through the second delay line 212 and is coupled upon output to the phase detector 214 and a 360° clock output 208. The phase detector 214 then compares the 360° clock output from the second delay line 212 to the next incoming clock signal (the 0° clock) received at the clock input 202 and iteratively adjusts the lengths of both the first and second delay lines equally, selecting differing output taps, until the 0° clock and the 360° clock match and the DCC circuit 200 “locks” to the clock signal and the DCC has been adjusted so that an entire period of the input clock signal fits within the delay lengths of the sequentially coupled first and second delay lines 210, 212.

Since the first and second delay lines 210, 212 are adjusted an equal amount by the control signals 216 from the phase detector 214, they remain in a fixed ratio to each other that is dependent on the relative lengths of the delay lines 210, 212 to each other. And since these lengths are equal, the outputs 204 and 206 are 180° out of phase with each other and can be logically combined to set an output clock signal with a 50% duty cycle that is synchronized to the rising or falling edge of the input clock signal. It is noted that other DCC circuit 200 embodiments of the present invention, with differing numbers of delay lines and duty cycle or phase shifted output signals are possible and should be apparent to those skilled in the art with the benefit of the present disclosure.

FIG. 3 details a DLL circuit 300 of an embodiment of the present invention. The DLL circuit 300 of FIG. 3 contains an input buffer 302, a delay line 304, a clock tree delay element 306, an output buffer 314, an input model delay element 308 and a phase detector 310. The input buffer 302 is coupled to the input of the delay line 304 and an input of the phase detector 310. The delay line output is coupled to the clock tree delay 306. The output of the clock tree delay 306 is coupled to the clock output buffer 314 and the input model delay 308. The output of the input model delay 308 is coupled to another input of the phase detector 310. The output of the phase detector 310 is coupled 312 to the delay line 304 controlling its adjustment.

In operation the delay line 300 of the DLL circuit 300 is utilized to generate a replica of an input clock signal 320. The incoming input clock signal 320 is coupled to the input buffer 302 and triggers the start of the propagation of the generated clock signal in the delay line 304. The generated clock signal output from the delay line is then adjusted for known delay of the internal clock distribution tree (the propagation path of the generated clock signal within the integrated circuit) by being propagated through the tree delay 306. The generated clock signal is then coupled to the output buffer 314 to become the synchronized output clock signal 322. The clock signal output from the tree delay 306 is also coupled through the input buffer model 308 (which models and compensates for the delay effect of the input buffer 302 and output delay 314) to the phase detector 310. The phase detector 310 compares the delay adjusted generated clock signal to the following input clock signal and adjusts 312 the length of the delay line (typically by a fixed amount, such as +/− a single delay element on each clock cycle, although adaptive adjustment is also known) to bring the generated clock signal closer to matching the input clock signal frequency. This adjustment of the delay line 304 is iteratively repeated until a match or “lock” of the input clock signal 320 is achieved by the DLL circuit 300. When the circuit containing the DLL circuit 300 is powering up or coming out of a low power mode it is not uncommon for 50 or more clock cycles to pass until the feedback based DLL circuit 300 attains a lock with the external clock, delaying initial operation of or access to the integrated circuit or memory device. Upon power-up or initialization of the DLL circuit 300 typically starts clock generation and iteration at its initial entry point 316 to begin searching for a lock to the input clock signal 320. This initial entry point 316 is typically chosen close to the highest frequency lock of the DLL circuit 300. Lock to the most likely/highest frequency input clock signal is typically achieved in the tLock region of the delay line 304, with the tbuff region designed to accommodate variations due to voltage and temperature differences. The tID region 318 represents the intrinsic delay, the smallest delay/number of delay elements that the delay line 304 is capable of. It is typically because of this tID region 318 and the tree delay, which set a fixed minimum delay/delay elements of the DLL circuit 300 that discontinuities of the DLL circuit's lock curve occur when wrap-around occurs and multiple clock signals are present within the DLL circuit's loop. It is noted that other DLL circuit 300 embodiments of the present invention are possible and should be apparent to those skilled in the art with the benefit of the present disclosure. As stated above, both stepped and graduated delay lines can be graduated up and down and/or have multiple “peaks” and “valleys” in the delay line unit delay profile to provide appropriate coverage for the specific implementation. It is noted, however, that the most common format will include a delay line profile that only increases in unit delay time, such as shown in FIGS. 4 and 5.

As stated above, clock synchronization and skew adjustment circuits of embodiments of the present invention incorporate stepped or graduated delay lines. FIGS. 4 and 5 detail simplified diagrams of a stepped delay line 400 and a graduated delay line 500 of embodiments of the present invention. The graphs of FIGS. 4 and 5 detail delay line delay element profiles, where the number of delays in the delay line increases as one traverses the x-axis and the unit time delay of each delay element is shown on the y-axis of the graph. FIG. 4 details a stepped delay line profile 400 that “steps” 402 with an abrupt transition to a higher unit time delay for each additional delay element in the delay line. It is noted, as stated above, that stepped delay lines are particularly useful in retrofitting existing designs by simply increasing the unit delay of the existing delay elements of the design, typically by increasing the size of delay element transistors. It is also noted that stepped delay line profiles 400 can include one or more “steps” 402 in the unit time delay of each delay element in the delay line profile.

Graduated delay lines 500, such as detailed in FIG. 5, differ from stepped delay lines 400 in that they gradually increase 502 in unit time delay per delay element, such that each following delay element utilized in the delay line has a higher unit time delay than the proceeding delay element. Unlike a stepped delay line, the smoother graduated delay line will produce the maximum unit delay only when needed and will tend to minimize clock synchronization and skew adjustment circuit performance and design issues due to the lack of abrupt transitions in the delay line. It is noted that as the number of steps of the stepped delay line profile 400 are increased in number, the stepped delay line profile becomes more and more that of a graduated delay line profile 500. It is also noted that the aspects of stepped and graduated delay lines can be combined, such that the “step” of a stepped delay line is gradually transitioned through in a graduated manner to allow the delay line to avoid possible instabilities due to an abrupt transition. It is further noted that other delay line profiles 400, 500 of embodiments of the present invention are possible and should be apparent to those skilled in the art with the benefit of the present disclosure.

As also stated above, stepped or graduated delay lines in non-feedback based and non-wrapping feedback based clock synchronization and skew adjustment circuits of embodiments of the present invention, such as SMD circuits or DCC circuits, can begin graduating or stepping to coarser/lower resolution delay elements as soon as the necessary fine resolution is achieved at the critical timings or frequency ranges. This stepping or graduating is typically done according to a resolution de-rating table or curve specified by the designer so as to achieve acceptable resolution at the specified operating frequency ranges of the integrated circuit it is utilized in. It is noted that in many cases, once an integrated circuit or memory device is being operated outside of the frequency range of its design specification the signal timings are out of specification anyways and therefore the accuracy of the frequency lock is, by definition, less critical.

FIG. 6 details a lock point curve for a feedback based clock synchronization and skew adjustment circuit, such as a DLL circuit, of an embodiment of the present invention that can “wrap-around” internally and contain one to N clock cycles internal to the forward path of the circuit. The clock multiples may not necessarily be contained within the DLL delay line, but in the I/O delay (and I/O model delay). This wrap-around capability allows the circuit the advantage of being able to synchronize with a large range of clock frequencies. However, it leads to the above detailed characteristic “sawtooth” lock point curve as the differing amounts of delay lines are required to lock as additional clock periods are wrapped around and added to the internal loop as the input clock frequencies change. In addition, many of these circuits have a minimum amount of delay that is possible in the feedback loop, further adding to the discontinuities when the circuit wraps around.

Because of this “wrap-around” ability of a feedback based clock synchronization and skew adjustment circuit, such as a DLL circuit, a stepped or graduated delay line should begin its graduation or stepping to coarser delay elements for lower frequencies above the delay element peak of (the highest number of delay elements used by) the previous higher frequency lock curve sawtooth, as stated above. This will allow the stepping or graduation to not affect higher frequency sections of the lock curve where more fine resolution is desirable by stepping or graduating too early and inadvertently affecting these higher frequency/higher integer clock cycle saw-tooth sections of the lock curve.

In FIG. 6, a lock curve of feedback based clock synchronization and skew adjustment circuit, such as a DLL circuit, is shown having a y-axis 602 of the number of delay elements inserted in the delay line to lock (the number of delay element signal taps used) and the duration of the corresponding clock period on the x-axis 604. A series of sawtooth peaks (3*tCK, 2*tCK, and 1*tCK) are detailed each having an N*tCK lock slope 612, 610, and 608 on the left side and a discontinuity 614, 616 on the right where clock wrap-around occurs. The minimum amount of delay in the delay line/minimum number of delay elements is shown by the horizontal line “Init to M” 606. Each of the sawtooth peaks (3*tCK, 2*tCK, and 1*tCK) is due to the N number of clock cycles (N*tCK) being held within the DLL circuit loop (the feedback based clock synchronization and skew adjustment circuit) for the circuit to synchronize and lock-in at a given input clock frequency range. It is noted that the number of clock cycles held in the DLL loop increase as the input clock frequency is increased and that the slope (N/td) of each sawtooth lock curve 612, 610, 608 increases in steepness as the clock frequency is increased. It is also noted that the top of each higher frequency sawtooth peak ({N+1}*tCK) is lower than the previous peak (N*tCK). Thus, if graduated or stepped delay lines of embodiments of the present invention are designed to begin graduating or stepping to longer coarser delay elements at delay elements that are above the peak of the next higher frequency sawtooth curve ({N+1}*tCK) in the delay line they will not inadvertently affect the lock ability and resolution at these higher clock speeds.

FIGS. 7A and 7B show lock curves 700, 750 of feedback based clock synchronization and skew adjustment circuit, such as a DLL circuit, where the circuit does not have enough delay elements to lock at slower frequencies. In FIG. 7A, the limit of the number of total delay elements is shown by horizontal line 702. The upper limits of both the 2*tCK lock curve sawtooth 704 and 1*tCK lock curve sawtooth 706 are shown extending beyond this limit 702 in the total delay/total number of delay elements available in the delay line of the circuit. Because of this, the feedback based clock synchronization and skew adjustment circuit will fail to appropriately lock at the frequencies where the lock curve extends beyond the limit 702 in the number of available delay elements in the delay lines, as shown in regions 704 and 706. An example of such a failure is shown in FIG. 7B, where the lock curve 750 of a feedback based clock synchronization and skew adjustment circuit implemented with too little time delay and/or delay elements is shown. It is noted that a peak 752 of the lock curve 750 has been flattened by the lack of available delay elements, as the DLL hangs on the end of the sawtooth curve before transitioning to the next lower curve. It is noted that in some cases, where the DLL has not been designed to hang on the end of the closest matching lock curve 752, it will reset and start searching for a new lock point. In these cases the part loses lock entirely (instead of being simply out of specification) and will not work at all at these frequencies.

FIGS. 8A and 8B show lock curves 800, 850 of feedback based clock synchronization and skew adjustment circuit of an embodiment of the present invention, where the circuit incorporates a graduated or stepped delay line and thus does have enough delay elements to lock at slower frequencies. In FIG. 8A, the start of stepping or graduating of the delay line is shown by horizontal line 808 and the limit of the number of total delay line delay is shown by horizontal line 802 defining a region of coarser unit delay elements 810 in the delay line. The upper limits of both the 2*tCK lock curve sawtooth 804 and 1*tCK lock curve sawtooth 806 are shown extending into this region of increased (stepped or graduated) unit time delay elements 810. Because of this, the feedback based clock synchronization and skew adjustment circuit will not fail and will appropriately lock at the frequencies or voltages where the lock curve extends into the stepped or graduated region of delay elements 810 even though the same or fewer number of delay elements are being utilized by the delay line over a conventional linear delay line profile. It is noted that the region where the delay per delay element stage is increased, tCK has a longer period (slower frequency) and the timing is therefore less critical. Thus the decreased resolution does not have a negative impact on the clock synchronization and skew adjustment circuit lock ability or the performance of the system it is implemented in. The same or decreased number of delay elements also improves clock synchronization and skew adjustment circuit performance by reducing the number of active signals it contains, reducing error amplification and minimizing circuit layout size. It is also noted that the higher frequency 3*tCK sawtooth peak 812 is situated below the start of stepping/graduation 808 and thus the clock synchronization and skew adjustment circuit will not be affected by the stepped or graduated delay line at these higher frequencies and will not lose lock resolution. An example of such a graduation is shown in FIG. 8B, where the lock curve 850 of a feedback based clock synchronization and skew adjustment circuit implemented with a graduated delay line of graduated unit time delay elements is shown.

FIG. 9 details a system 950 of an embodiment of the present invention with a host 954 (such as a memory controller or processor) coupled to a volatile memory device 952, such as a DRAM memory device, of an embodiment of the present invention. It is noted that the memory device of FIG. 9 is only shown as an example, and embodiments of the present invention can include multiple types of other integrated circuits (i.e., a processor, a non-volatile memory device, an ASIC, etc.). It is also noted that in other embodiments of the present invention, the system 950 and/or memory device 952 may include memory of another type, memory bus, and/or memory interface, including, but not limited to, SDRAM, DDR SDRAM, quad data rate (QDR), synchronous graphic DRAM (SGDRAM), DDR II SDRAM, Rambus memory, video RAM (VRAM), static RAM (SRAM), read only memory (ROM), electrically erasable programmable ROM (EEPROM), Flash memory, or other volatile or non-volatile memory with either a synchronous or asynchronous interfaces. Memory devices are well known in the prior art and the following description is intended only to be an overview of their operation and provide an example of their operation with an embodiment of the present invention.

The memory device 952 of FIG. 9 contains an array of memory cells (typically volatile capacitor-based memory cells in a volatile DRAM memory or non-volatile floating gate memory cells in a non-volatile Flash or EEPROM memory) arranged in columns coupled to bit lines as its main memory structure. In the memory device 952 of FIG. 9, address values for the memory are received on the external address bus connections 900. The received address values are stored internal to the memory device 952 and utilized to select the memory cells in the array to read and write data to. In the case of the memory device 952 of FIG. 9, the internal address register is shown split with both a row address register 902 and column address register 904. The address values held in the row address register 902 and column address register 904, select a data word from the memory cell array 906 through the coupled row decode 908 and column decode 910 circuits. The selected data word is read by the coupled sense amplifier circuit 912. Data values for the memory device of FIG. 9 are received and transmitted on the bi-directional data word interface 914. Internal to the memory device 952, data to be written to or from the memory array 906 is held in the Data buffer 916. Control of the memory device 952 of FIG. 9 for both read operations and write operations is actuated by the internal control circuit 918. The control circuit 918 operates in response external control signals received on control signal external interface connections 920 and to internal events. As shown in FIG. 9, all external signal connections to the memory device 952 from the host 954 are coupled to internal I/O buffers on the integrated circuit chip (922, 924, and 926). I/O buffers are shown for each of the major interfaces, address I/O buffers 922, data I/O buffers 926, and control signal I/O buffers 924. Interconnect lines (928, 930, 932, and 934) couple the I/O buffers (922, 924, and 926) to their corresponding active component input circuits that accept the incoming signals. The memory device 952 of FIG. 9 is shown integrating a clock recovery and skew adjustment circuit 936 that incorporates an embodiment of the present invention. An external clock signal 938 is coupled to the clock recovery and skew adjustment circuit 936, such as a SMD or DLL circuit, which recovers and skew adjusts the external clock. The clock recovery and skew adjustment circuit 936 also asserts a DQS strobe signal 940 and an internal clock signal 942, when the memory circuit 952 of FIG. 9 is accessed.

CONCLUSION

Clock synchronization and skew adjustment circuits have been described utilizing varying unit delay elements in the delay lines in either a graduated or a stepped unit time delay arrangement, allowing a reduced circuit implementation and improved lock characteristics. The stepped or graduated delay lines allow for reduced circuit size while maintaining a high resolution by placing a fine delay element granularity at the most critical timings for accurate signal lock. Integrated circuits and memory devices, in accordance with embodiments of the present invention utilize clock synchronization and skew adjustment circuits with either a graduated or a stepped time delay arrangement in their delay lines. These graduated or a stepped unit time delays allow reduction of the number of fine unit delay elements of the delay lines to only those necessary to sense and adjust for the high frequency signals or that portion of the clock signal time period that is critical for operation. The described invention allows clock synchronization and skew adjustment circuits to be implemented in an optimized manner that exhibits a reduced overall circuit size and power consumption, while improving lock characteristics over a wide range of frequencies.

Although specific embodiments have been illustrated and described herein, it will be appreciated by those of ordinary skill in the art that any arrangement, which is calculated to achieve the same purpose, may be substituted for the specific embodiment shown. This application is intended to cover any adaptations or variations of the present invention. Therefore, it is manifestly intended that this invention be limited only by the claims and the equivalents thereof. 

1. A duty cycle corrector (DCC) circuit, comprising: a plurality of delay lines, wherein each delay line contains a plurality of serially-connected delay elements that differ in unit time delay and wherein the plurality of delay lines are sequentially coupled input to output; an input buffer coupled to a first delay line; and a phase detector coupled to a final delay line and the input buffer, wherein the phase detector is configured to compare an input clock signal received by the input buffer and compare it to a delayed clock signal coupled from the final delay line and adjust the plurality of delay lines to synchronize the input clock signal and the delayed clock signal.
 2. The duty cycle corrector (DCC) circuit of claim 1, wherein the plurality of delay lines are selected from the group consisting of stepped delay lines, having a stepped profile in the unit time delay of the serially-connected delay elements of each delay line, and graduated delay lines, having a graduated profile in the unit time delay of the serially-connected delay elements of each delay line.
 3. The duty cycle corrector (DCC) circuit of claim 2, wherein the stepping or graduating of the profile of unit time delay of the serially-connected delay elements is selected according to a resolution de-rating table or a curve specified to achieve acceptable resolution at specified operating frequency ranges.
 4. The duty cycle corrector (DCC) circuit of claim 2, wherein the stepped or graduated profile of the unit time delay of the serially-connected delay elements has multiple peaks and valleys in the profile, where the unit time delay of the serially-connected delay elements are stepped or graduated up and/or down into.
 5. The duty cycle corrector (DCC) circuit of claim 1, wherein the phase detector configured to adjust the plurality of delay lines comprises the phase detector configured to select a number of the plurality of serially-connected delay elements of each of the delay lines to be the same number.
 6. The duty cycle corrector (DCC) circuit of claim 1, wherein each of the plurality of delay lines has a length, and wherein the length of each of the plurality of delay lines is the same.
 7. The duty cycle corrector (DCC) circuit of claim 1, wherein an output of one of the plurality of delay lines is configured to be 180° out of phase with another one of the plurality of delay lines.
 8. A duty cycle corrector (DCC) circuit, comprising: a plurality of delay lines, wherein each delay line contains a plurality of serially-connected delay elements, wherein each delay line utilizes time delays with a graduated time delay arrangement, and wherein the plurality of delay lines are sequentially coupled input to output; a phase detector coupled to a final delay line and an input clock signal, wherein the phase detector is configured to compare the input clock signal and compare it to a delayed clock signal coupled from the final delay line and adjust the plurality of delay lines to synchronize the input clock signal and the delayed clock signal.
 9. The duty cycle corrector (DCC) circuit of claim 8, wherein each delay line utilizes time delays with multiple graduated time delay arrangements, producing at least one valley in a unit delay profile of each delay line.
 10. The duty cycle corrector (DCC) circuit of claim 8, wherein the delayed clock signal coupled from the final delay line is configured to be phase shifted 360° from the input clock signal.
 11. The duty cycle corrector (DCC) circuit of claim 8, wherein each of the plurality of delay lines has a length.
 12. The duty cycle corrector (DCC) circuit of claim 8, wherein the length of each of the plurality of delay lines is the same.
 13. The duty cycle corrector (DCC) circuit of claim 8, wherein the length of one of the plurality of delay lines is different than the length of another one of the plurality of delay lines.
 14. A duty cycle corrector (DCC) circuit, comprising: a plurality of delay lines, wherein each delay line contains a plurality of serially-connected delay elements, wherein each delay line utilizes time delays with a stepped time delay arrangement, and wherein the plurality of delay lines are sequentially coupled input to output; a phase detector coupled to a final delay line and an input clock signal, wherein the phase detector is configured to compare the input clock signal and compare it to a delayed clock signal coupled from the final delay line and adjust the plurality of delay lines to synchronize the input clock signal and the delayed clock signal.
 15. The duty cycle corrector (DCC) circuit of claim 14, wherein each delay line utilizes time delays with multiple stepped time delay arrangements, producing at least one valley in a unit delay profile of each delay line.
 16. The duty cycle corrector (DCC) circuit of claim 14, wherein the delayed clock signal coupled from the final delay line is configured to be phase shifted 360° from the input clock signal.
 17. The duty cycle corrector (DCC) circuit of claim 14, wherein the phase detector configured to adjust the plurality of delay lines comprises the phase detector configured to select a number of the plurality of serially-connected delay elements of each of the delay lines to be the same number.
 18. The duty cycle corrector (DCC) circuit of claim 14, wherein the plurality of delay lines comprises two delay lines, wherein an input of a second delay line of the two delay lines is coupled to the output of a first delay line of the two delay lines, wherein an input of the first delay line is coupled to receive the input clock signal, and wherein the second delay line is the final delay line.
 19. The duty cycle corrector (DCC) circuit of claim 18, wherein the first delay line and the second delay line each have a length.
 20. The duty cycle corrector (DCC) circuit of claim 19, wherein a phase of a delayed clock signal coupled from the first delay line is dependent on the relative lengths of the first and second delay lines to each other.
 21. The duty cycle corrector (DCC) circuit of claim 18, wherein a phase of a delayed clock signal coupled from the first delay line is shifted 180° from a phase of the delayed clock signal coupled from the final delay line. 